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Abstract — We study the half-duplex multiple-relay channel 
(HD-MRC) where every node can either transmit or listen but 
cannot do both at the same time. We obtain a capacity upper 
bound based on a max-flow min-cut argument and achievable 
transmission rates based on the decode-forward (DF) coding 
strategy, for both the discrete memoryless HD-MRC and the 
phase-fading HD-MRC. We discover that both the upper bound 
and the achievable rates are functions of the transmit/listen state 
(a description of which nodes transmit and which receive). More 
precisely, they are functions of the time fraction of the different 
states, which we term a schedule. We formulate the optimal 
scheduling problem to find an optimal schedule that maximizes 
the DF rate. The optimal scheduling problem turns out to be 
a maximin optimization, for which we propose an algorithmic 
solution. We demonstrate our approach on a four-node multiple- 
relay channel, obtaining closed-form solutions in certain scenar- 
ios. Furthermore, we show that for the received signal-to-noise 
ratio degraded phase-fading HD-MRC, the optimal scheduling 
problem can be simplified to a max optimization. 

Index Terms — Decode-forward (DF), half duplex, multiple- 
relay channel (MRC), scheduling, phase fading. 



I. Introduction 

In this paper, we investigate the D-node half-duplex 
multiple -relay channel (HD-MRC). The ZJ-node multiple-relay 
channel (MRC) models a network where a source transmits 
data to a destination with the help of {D — 2) relays, which 
themselves have no data to send. In half-duplex channels, a 
node can either listen or transmit but cannot do both at the 
same time. Analyses of half-duplex channels are appealing 
from a practical point of view as they model most transceivers 
available in the market today. 

Though much has been done to understand the full-duplex 
multiple-relay channel (FD-MRC) |1|-|3| in which all nodes 
can transmit and listen at the same time, its half-duplex 
counterpart receives less attention for it is often thought of as 
a special case of the full-duplex channel, where a node simply 
stops listening when it transmits and vice versa. However, 
we will show in this paper that half-duplex channels identify 
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issues, e.g., scheduling, that do not appear in the full-duplex 
counterpart. We will also show that the coordination of the 
transmitting and the listening modes among the relays plays 
an important role in determining the bottleneck of the half- 
duplex channel. 

For the half-duplex networks, the transmitAisten state is 
defined to describe which nodes are transmitting (or listening) 
at any time (by default, nodes not transmitting are considered 
to be listening). In the D-node HD-MRC, where the source 
always transmits and the destination always listens, the total 
number of transmit/listen states is 2^^^, capturing which of 
the {D — 2) relays are transmitting. 

In this paper, we first obtain an upper bound to the capacity 
of the HD-MRC by specializing the cut-set bound for the half- 
duplex multi terminal networks Q. We then use the idea of the 
decode-forward (DF) coding strategy for the FD-MRC |1| to 
derive a lower bound to the capacity (i.e., achievable rate) of 
the HD-MRC. We will see that the capacity upper bound and 
the achievable DF rates depend on the time fractions of the 
transmit/listen states, but do not depend on the sequence or the 
order of the states. This means we do not need to coordinate 
the transmit-listen sequence among the nodes to maximize 
the achievable DF rate or to find the capacity upper bound. 
We can view the combination of time fractions of different 
transmit/listen states as a schedule. We then formulate the 
optimal scheduling problem to find an optimal schedule that 
maximizes the DF rate for the HD-MRC. 

The optimal DF scheduling problem turns out to be a 
maximin problem, which is not easily solved in general. We 
propose a greedy algorithm to solve the general maximin 
problem. We then show how the algorithm can be simplified 
when applied to the optimal DF scheduling problem. Using 
the simplified algorithm, we obtain closed-form solutions for 
certain channel topologies. Furthermore, we show that for the 
received signal-to-noise ratio (rSNR) degraded phase-fading 
HD-MRC, the optimal scheduling problem can be further 
simplified to a max optimization. We compute DF rates of 
HD-MRCs and compare them to that of the corresponding 
FD-MRCs. We highlight some differences between the half- 
duplex and the full -duplex channels. 

The specific contributions of this paper are as follows: 

i) We obtain an upper bound to the capacity of the discrete 
memoryless HD-MRC and the phase-fading HD-MRC, 
by specializing the cut-set bound for the general half- 
duplex multi terminal network |4|. 

ii) We derive achievable rates for the discrete memoryless 
HD-MRC and the phase-fading HD-MRC. The coding 
scheme is constructed based on decode-forward (a cod- 



2 



ing strategy for the FD-MRC |]T|) and the idea of cycHng 
through all transmit/listen states in each codeword (an 
extension of that used for the half-duplex single-relay 
channel ||5)). Note that this result on achievable rates 
was independently derived by us, in an earlier confer- 
ence version of this work ||7|, and by Rost and 
Fettweis (8), ||9|. 

iii) We formulate the optimal scheduling problem for the 
phase-fading HD-MRC to find the optimal schedule that 
gives the highest DP rate, and propose an algorithm to 
solve the scheduling problem. 

iv) We show that for the rSNR-degraded phase-fading HD- 
MRC, the optimal scheduling problem can be simplified 
to a max optimization. 

v) We compare the DP rates of the HD-MRC to those of 
the PD-MRC. The bottleneck nodes in the PD-MRC 
depend on the channel topology. On the other hand, 
the bottleneck of the phase-fading HD-MRC is always 
constrained by the first B relays, where 1 < B < D — 2, 
or by all the relays and the destination. Por the case of 
rSNR-degraded phase-fading HD-MRC, the bottleneck 
is always constrained by all the relays and the destina- 
tion, which is independent of the topology. 

A. Related Work 

The Gaussian half-duplex single-relay channel (HD-SRC), 
i.e., the MRC with D = 3, was first studied by H0st- 
Madsen |5| who derived a capacity upper bound based on cut- 
set arguments and a capacity lower bound based on the partial- 
decode-forward relaying strategy where the relay decodes part 
of the source message. A year later Khojastepour et al. |10|, 
[ [TT) showed that partial-decode-forward achieves the capacity 
of the degraded (Gaussian and discrete memoryless) HD-SRC, 
a result that mirrors the full-duplex counterpart where decode- 
forward achieves the capacity of the degraded single-relay 
channel | [T2j . In the aforementioned work, the transmit/listen 
state for each channel use is fixed and known a priori to all 
nodes. In 2004, Kramer fl3\ showed that larger DP rates could 
potentially be achieved using random states because the choice 
of state could be used by the source and the relay to transmit 
information. 

Other strategies have been proposed for the HD-SRC, e.g. 
(i) compress-forward where the relay quantizes its received 
signals and forwards them to the destination 1 14|, and (ii) one 



where the relay decodes every second source message (15| 
(i.e., the relay toggles between the transmit and listen modes 
for alternate blocks of message transmission of the source). 
The fading HD-SRC has been studied by H0st-Madsen and 
Zhang | I4] and Rankov and Wittneben |16|. A hybrid scheme 
using full-duplex and half-duplex relaying was proposed for 
the single-relay channel by Yamamoto et al. |17) . 

Por the half-duplex multiple-relay channel (HD-MRC) 
(where D > 3), Khojastepour et al. considered the cascaded 
MRC where the source, the relays, and the destination form a 
"chain", and each node only receives the transmissions from 
one node (i.e., the upstream node in the chain). The capacity of 
the cascaded HD-MRC was derived for the Gaussian case |[TT| 



as well as the discrete memoryless case pO) . Another class of 
the HD-MRC where the relays only receive signals from the 
source has been studied |[T8)-||20|. 

The general HD-MRC where each relay (in its listening 
mode) can receive the transmissions of all other nodes was first 
studied by Kramer et al. [21, where they mentioned that DP 
for the FD-MRC can be extended to the HD-MRC. However, 
explicit expressions for achievable rates were not derived. In 
2008, in the conference version of this paper [6|, |7|, we 
studied this class of general HD-MRCs, and derived an upper 
bound to the capacity based on cut-set arguments and a lower 
bound to the capacity based on DP. In the same papers we 
introduced scheduling for the HD-MRC and algorithms to 
optimize the schedule to achieve the maximum DP rate. Inde- 
pendently in the same year, Rost and Fettweis derived capacity 
lower bounds for (i) the discrete memoryless HD-MRC ||8| 
using partial-decode-forward and compress-forward strategies, 
(ii) the Gaussian HD-MRC \9\ using partial-decode-forward 
and amplify-forward strategies. Their results on achievable 
rates include ours as a special casej^The scaling behavior of 
the HD-MRC, i.e., when the number of relays grow to infinity, 
was studied by Gastpar and Vetterli [IS]. 

In half-duplex networks, the question of scheduling arises, 
i.e., how the relays coordinate their transmit/listen modes. 
ScheduHng for the HD-MRC with two relays (i.e., D = 4) 
and where there is no direct source-destination link has been 
recently studied by Rezaei et al. [21). It was shown that 
when the signal-to-noise ratio (SNR) is low and under certain 
conditions, the optimal schedule spans over two states: (i) both 
relays listen, and (ii) both relays transmit. Por the high SNR 
scenario, the optimal schedule is such that at any time only 
one relay listens and the other transmits. In this paper, we 
study scheduling for the Gaussian HD-MRC for any number 
of relays assuming that DP coding strategy is used. We propose 
an algorithm to find the optimal schedule for DP. 

In the literature, scheduling for the relay network is also 
studied under other context, e.g., (i) resource allocation: al- 
locating time fractions for relays to forward data to different 
destinations [22J-[24j, and (ii) interference avoidance: avoid- 
ing simultaneous transmissions among nearby relays (graph 
coloring) 125), ||26). 

II. Channel Models 

A. Discrete Memoryless HD-MRC 

The FD-MRC was introduced by Gupta and Kumar p7) , 
and Xie and Kumar In a D-node MRC with nodes 
{1,2,...,_D} = T), node 1 is the source, node D the 
destination, and nodes 2 to (Z? — 1) the relays. A message 
W is generated at node 1 and is to be sent to node D. In 
this paper, we consider the HD-MRC where a node can only 
transmit (T) or listen (L) at any point in time. We assume 
that the source is always transmitting, and the destination is 
always listening. 



'in contrast to this paper, optimization of states, or sclieduling, was not 
considered by Rost and Fettweis Jsj, j9j. 
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Fig. 1. The HD-MRC in state s = (L, . . . , T) where node 2 listens and 
node (D — 1) transmits 



We use a transmit/listen state vector s to capture the half- 
duplex scenarios among the relays. For the _D-node HD- 
MRC where the source always transmits and the destina- 
tion always listens, the state can be expressed as s = 
(s(i),...,sW,...,s(^-2)) €E {L,T}^-2, where the i-th 
element of s is T if node + transmits, and is L otherwise, 
i.e., if node listens. We will use the terms transmit/listen 

state and state interchangeably in the rest of this paper. In this 
paper, we only consider the case where the state is fixed and 
known a priori to all nodes. 

We define the set of all relays as 7?. = {2, 3, — 1}. 

In state s, we further define C{s) as the set of relays that are 
listening and T(s) as the set of relays that are transmitting, i.e., 
Cis) ^{i&TZ: = L} and r(s) ^ {i £ TZ : s('-i) ^ 

T}. Note that £(s) n r(s) = and C{s) U Tis) = U. 

Throughout this paper, for each random variable denoted 
by an upper letter X, its realization is denoted by the cor- 
responding lower letter x. We use the calligraphic font to 
denote sets, and bold letters to denote vectors. For example, 
X = {X[l],X[2],...,X[n]), and for S = 62, • • ■ , 

— {^bi , , . . . , Xb|g| }. 

We denote the input from user i to the channel by Xi E Xi, 
and the channel output received by user i by G . When a 
node i is listening, i.e., not transmitting, we assign a dummy 
symbol as its input to the channel, Xi e Xi, where Xi is fixed 
and is known a priori to all nodes. Similarly, when node i is 
transmitting, we assign iji G yi to be its (dummy) received 
channel output, where is also fixed and known a priori to 
all nodes. For example, in the Gaussian half-duplex channel 
where channel inputs and outputs are taken over the set of real 
numbers, we can set Xi — if node i is listening or iji = if 
node i is transmitting. 

The D-node HD-MRC in state s = {L, s^^\ . . . , s^^-^\T) 
is depicted in Fig. [T| In this state, node 2 is listening and node 
{D — 1) is transmitting. 

Since the HD-MRC is an instance of the FD-MRC in a 
certain transmit/listen state, the channel distribution of the 
HD-MRC follows the underlying FD-MRC. Let the channel 
distribution for a D-node FD-MRC be 



p*(2/2,y3, 



,yD\Xl,X2, 



,XD~l) 



on 3^2 X 3^3 X ••• X yo, for each {xi,X2, ■ ■ ■ ,xd-i) e 
A"! X A'2 X • • • X Xd^i- The channel distribution for the 
corresponding D-node HD-MRC in state s is given by 



P{y2,y3, ■ ■ ■,yD\xi,X2, . . .,Xd-1,s) 

^p{yc(s),yD\xi,x2,-.-,XD-i,yT(,s)) Yi ^iv^^Vi)^ 



(1) 



where 



v{yc(s),yD\xi,x2, ■ ■ .,XD-i,yris)) 

P*(y2, 2/3, ■■■,yD\xi,X2,-.-, XD^l) 



(2) 



^VC(s),VD -P*(y2,y3,- ■ .,yD\Xl,X2, ■ ■ -^Xd-I 

5{A) is the indicator function that is 1 if the event A is true, 
and is otherwise. The expression in Q forces that with 
probability equals one, node z's received signal will be iji if 
it is transmitting. In this paper, we only consider memoryless 
and time-invariant channels. 

B. Block Codes and Achievable Rates 

In the MRC, the information source at node 1 emits a 
random messages W E {1, 2"-''} = W, which is to be 
sent to the destination, node D. We consider block coding 
with n uses of the channel. An (2"^, n) code for the D-node 
HD-MRC comprises the following: 

i) (source) An encoding function at node 1, /i : >V — ?> 
Xi, such that Xi = fi{W), which maps a source 
message to a codeword of length n. Here Xi = 
(Xi[l],Xi[2], . . . where Xi[t] is node I's input 
to the channel at time t. 

ii) (relays) n encoding functions at each node i £ 
{2,..., i?- 1}, 3;*-! ^ A-,, for alHe {!,...,«}, 
such that X,[t] = /,,t(K,[l], K,[2], . . . , Y,[t - 1]), which 
map past received signals to the next transmit signal. 

iii) (relays) The restriction that at the t-th channel use in 
state s, the transmit messages of the relays that are 
listening are restricted to Xi[t] = Xi, Vz £ C{s). 

iv) (destination) A decoding function at node D, go : 
y^ —7- W, such that W — goi^ d)^ which maps n 
received signals to a source message estimate. 

On the assumption that the source message W is uniformly 
distributed over W, the average error probability is defined 
as Pe = Pr{VF ^ W}. The rate R is achievable if, for any 
e > 0, there is at least one (2"^,n) code such that Pg < e. 
The capacity C is defined as the supremum of the achievable 
rates. 

C. Phase-Fading HD-MRC 

Now, we define the phase-fading HD-MRC. We set Xi = Q 
for nodes that are listening, and yi — for nodes that are trans- 
mitting. In state s, the received signal at node k £ {2, . . . , D} 
is given by 

f YI V\'ke'''^''''^X, + Zk, iffc€£(s)U{D} 
Yk^ I ie{i}uT[s) 

I yfc = 0, otherwise, i.e., \f k £ T{s), 

(3) 
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where each Xi is a complex random variable, Zk is the 
receiver noise at node k and is an i.i.d. circularly symmetric 
complex Gaussian random variable with variance E[ZkZl] ~ 
Nk, where zl is the complex-conjugate of Zk, Xi.k captures 
the path loss from node i to node k and is kc?^^' for dj > 1, 
and is k otherwise, di,k > is the distance between nodes i 
and A;, 77 > 2 is the attenuation exponent (with 77 = 2 for free 
space transmission), k is a positive constant, e-'^*^''*^) is the 
phase-fading random variable, 9{i,k) is uniformly distributed 
over [0, 2tt), and d{i, k) are jointly independent of each other 
for all channel uses, for all i, and for all k. 

We assume that all nodes know k and di,k for all node pairs. 
We also assume that node k only knows 9{i,k) for all i, and 
does not know any 9{i,l) for I 7^ k. Hence, the transmitted 
signals of node i cannot be chosen as a function of 9{i, k) for 
any k. 

In this paper, we consider the following individual-node per- 
symbol transmitted power constraint. Setting the half-duplex 
constraints Xi ~ Q for node i in the listening mode, we get 



< 



if i eT{s)U{l} 
otherwise, i.e., if i e C{s). 



(4) 



III. Upper Bound TO THE Capacity 



A. Capacity Upper Bound for the Discrete Memoryless HD- 
MRC 

An upper bound to the capacity of the HD-MRC is given 
in the following theorem. 

Theorem 1: Consider the discrete memoryless Z?-node HD- 
MRC. The capacity C is upper bounded by 

C < sup min 

^ p{s)l{xi,XQf^r{sy-,YQaf^c{s)-,YD 

X-Q^nTis) 1 Xc{s) = S:c{s) 7 ^Q<=nr(s) = yQTiTis) ,S^s 

(5) 

where p{xi,X2, . ■ . ,xd-i, s) = p{s)p{xi,xr(s)\xc{s), s) 
Y\kec{s) ^i^k — ik), = 72. \ Q, 72. is the set of all relays, 
T(s) is the set of all relays that are transmitting in state s, 
and C{s) is the set of all relays that are listening in state s. 

The aforementioned upper bound is obtained from the 
cut-set bound for the half-duplex multiple-source multiple- 
destination network |4|, which is a generalization of the 
cut-set bound for the full-duplex multiple-source multiple- 
destination network |28 Theorem 15.10.1]. Consider all cuts 
of the network separating {1} U Q from Q"^ U {D}, for all 
possible selections of relays Q C TZ. We see that the source 
is always on one side of the cut with the destination always 
on the other side. For simplicity, we call the set {1} U Q the 
left side of the cut and Q'^ U {D} the right side of the cut. 
An upper bound to the rate can be obtained by considering 
the maximum achievable rate across the cut, from the left to 
the right, assuming that all nodes on each side of the cut can 
fully cooperate. More specifically, consider any cut, the rate 
of message flow from the source to the destination (across 



the cut) is upper bounded by the mutual information between 
the inputs from all transmitting nodes on the left side of the 
cut and the outputs to all listening nodes on the right side, 
conditioned on the inputs and outputs of all transmitting nodes 
on the right side, and the inputs from all listening nodes (which 
is known a priori to all nodes). Hence, any achievable rate must 
be upper bounded by that for all the cuts (i.e., those with the 
source on the left side of the cut and the destination on the 
right side of the cut). The proof of Theorem [T| can be found 
in Appendix |A] 

B. Capacity Upper Bound for the Phase-Fading HD-MRC 

For the phase-fading HD-MRC, we set the inputs from 
transmitting nodes to the channel to be independent Gaussian. 
Note that channel inputs that take advantage of coherent com- 
bining are not feasible because the phase changes randomly 
for each channel use and is unknown to the transmitter. So, 
we get the following capacity upper bound. 

Theorem 2: Consider a Z3-node phase-fading HD-MRC. 
The capacity C is upper bounded by 



C < max min r 

p{s) Qcn 



(S) 



(6) 



where 



se{L,T}o-2 



p{s) 



\ fee(Q'=n£(s))u{r>} 



Sie{i}u(Qnr(s)) ^i.kPi 



(7) 



where r{x) — log2(l + x). 

Proof for Theorem^ Theorem |2] follows directly from 
Theorem[T]by using independent Gaussian inputs for all nodes. 

■ 

Remark 1: See fl4\ Lemma 1], ||2[ Theorems 6 & 7] for 
the optimality of independent Gaussian inputs in phase-fading 
channels. 

IV. Lower Bound to the Capacity 

A. Achievable DF Rates for the Discrete Memoryless HD- 
MRC 

Now, we present achievable rates for the discrete mem- 
oryless HD-MRC using DF. For a chosen probability mass 
function of the states p{s) and a fixed input distribution for 
each state p{xi, 0:2, ... , xd-i\s), we define the reception rate 
of node z G {2, 3, . . . , Z)} as follows: 



sG{L,T}°~2 
s.t. iG£(s)U{D} 



,i-l}nT{s) 



X{i,,,,^D-i}nT(s),Xc(s) = S:c(s),S = s 



(8) 



The reception rate of node i is the rate at which node i can 
decode the source message using DF. The following rate is 
achievable on the discrete memoryless HD-MRC. 
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Theorem 3: Consider the I?-node HD-MRC. Rates up to 
the following value are achievable: 



i?DF = max max min 

p{s) p{xi,X2.,...,xr,-l\s) ie{2,3,...,D} 



(9) 



for any p{xi,X2, . ■ . ,xd^i\s) = Ilkec{s)^(^k = 
S:k)pixi,Xf(^s)\xc{s)i^)7 where r,; is defined in 

Remark 2: The first maximization in (|9| is taken over all 
possible schedules, i.e., the probability mass functions of the 
states. The second maximization is taken over all possible 
input distributions for each state. The minimization is taken 
over the reception rates of all the relays and the destination — 
using DF, each relay and the destination must fully decode the 
source message. Note that the maximum DF rate only depends 
on the fractions of the states (i.e., the schedule), and does not 
depend on the sequence of the states. 

The aforementioned achievable rates are derived where the 
sequence in which the source message is decoded at the nodes 
is 1,2,...,D. This means node i decodes from all nodes 
with indices smaller than i (we term these nodes the upstream 
nodes), and transmits to all nodes with indices larger than i (we 
term these nodes the downstream nodes). In general, different 
decoding sequences give different DF rates. Hence, one can 
optimize (|9]l over all possible sequences with the constraint 
that node 1 being the first node and node D being the last 
node in the sequence | [29| . 

Sketch of Proof of Theorem [Jj Here, we briefly sum- 
marize the proof of Theorem |3] Details are given in Ap- 
pendix |B] We use the idea from DF for the FD-MRC by Xie 
and Kumar 1 1 , Section IV.2], which uses regular encoding 
and sliding-window (simultaneous) decoding technique. Block 
coding is used, and in each block of n channel uses, a 
node's transmit signals are functions of its newly decoded 
message and of its previously decoded messages that its 
downstream nodes are forwarding. Decoding of a message, 
as the term window-decoding implies, happens across a few 
blocks. A node decodes a message from all upstream nodes 
using a few blocks of received signals. For the construction 
of the codewords, we use the idea from the HD-SRC, where 
each codeword is split into two parts (in the first part, the 
relay listens; in the second part, the relay transmits) (51. 
Extending this idea to the MRC (assuming that all the nodes 
are synchronized), in each block, we cycle through all different 
states of the nodes' transmit/listen modes. In each codeword (a 
block of n channel uses), if a node only listens in n' (n' < n) 
channel uses, the rate at which it can decode the message will 
be reduced (compared to the full-duplex coding) accordingly. 
Furthermore, among the n' received signals, not all upstream 
nodes are transmitting. This means that a node will only get 
signals from transmitting nodes in these n' channel uses, and 
the rate at which it can decode the message can be determined 
accordingly. This is the reason why the order of the states does 
not affect the achievable rates, but the schedule does. 

Remark 3: Note that the achievable rate in Theorem |3] is 
different from simply taking the time-sharing average of the 
DF rates for the FD-MRC |1 Theorem 3.1] over all possi- 
ble transmit/listen states. Consider the single-relay channel 
(i.e., D = 3) as an example. The DF rate for the full- 



duplex case is Rip = min{/(Xi; ^21^2), /(^i, ^2; ^^s)}- 
For its half-duplex counterpart, there are two possible states: 
S = L cind S = T. When the relay listens [S = 
L], we have X2 = X2, meaning that I{Xi;Y2\X2) = 
I{Xi;Y2\X2 = i2) and I{Xi,X2;Y3) = HX.^Ys). So 
Rdf[s=l] = min{/(Xi;r2|X2 = is), /(^i; l^s)}- When the 
relay transmits [S — T], we have Y2 — y2, meaning that 
I{Xi;Y2\X2) = 0. So i?Dp[s^y] = 0. Denoting a = p{S = 
L) and taking the time-sharing average of these two DF rates, 
we obtain the following rate on the HD-MRC: 

"-^DF[S=L] + (1 - ^)^D¥[S=T\ 

= xmn{aI{X^-Y2\X2^i2),o^I{Xi;Y^)). (10) 

The aforementioned rate can only be lower than the DF rate 
obtained in Theorem |3] (using the same chosen a), i.e.. 



\Y^\X2 



X2 



min |q;/(Xi 

aliXi^Ys) + (1 - a)I{Xi,X2;Y3)y (11) 

Remark 4: In this paper, we only consider the "full" 
decode-forward strategy where all relays must fully decode 
the source message, and that imposes a constraint on each 
relay. Higher rates could possibly be obtained if we only use 
a subset of the relays to perform decode-forward (see, e.g., 
||3], 1 19 1). Of course, omitting some relays also means that we 
cannot use these relays to send data to the downstream relays 
and the destination. Other papers have considered (i) using 
the relays that do not decode the source message to perform 
compress-forward operations |8J, |9j, and (ii) having the relays 
decode only parts of the source message JS), 19), p9). 



B. Achievable DF Rates for the Phase Fading HD-MRC 

For phase-fading channels, we have the following achiev- 
able rates: 

Theorem 4: Consider the phase-fading D-node HD-MRC. 
Rates up to the following value are achievable: 



i?DF — max min 

p(s) i(E{2,3,...,D} 



where 



E 



s.t. ie£(s)u{D} 



}u({2,....»-i}nr(s)) 



(12) 



^k.iPk 



(13) 

which is the reception rate of node i. 

Proof for Theorem^ The rate is obtained from The- 
orem |3] using independent Gaussian inputs (see Remark [TJ. 



C. The Optimal DF Scheduling Problem 

We define a schedule and an optimal DF schedule of the 
HD-MRC as follows. The schedule for a half-duplex network 
is defined as the probability mass function of all possible 
transmit/listen states, i.e., p{s). An optimal DF schedule is 
the schedule that gives the highest DF rate. 
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Now, we formulate the optimal scheduling problem for the 
TD-node phase-fading HD-MRC. An optimal schedule is a 
probability mass function p*{s) on {L, T}^^^ that maximizes 
Rdf, i.e., 



p* (s) G argmax min 

ie{2,3,...,D} 



E 



s.t. iec{s)u{D} 



p{s)r\ 



Sfcg{l}U({2,...,t-l}nr(s)) ^kgPk 

N,, 



In words, the aforementioned function says that if p is 
a local maximum for the set B, we cannot simultaneously 
increase all {Ri{-) : i E B} by "moving" p in any direction 
in g. 

Now, we establish a necessary condition for the optimal 
solution to {T5\ . 

Lemma 1: Let the solution to ( [T5| ) be R* . There exists a set 
B = {bi,b2, ■ ■ ■ ^ bs} C {1,2,..., K} and an optimal p* e 
argmaxpgg min,jg{i,2,...,K} such that 



(14) 



V. Solving Optimal Schedules for the HD-MRC 
A. An Algorithm for General Maximin Problems 

From the previous section, we know that the optimal DF 
scheduling problem for the phase-fading HD-MRC is a max- 
imin optimization problem. In this section, we propose an 
algorithm to solve a general class of maximin optimization 
problems. We will show how this algorithm can be simplified 
when used to solve the DF scheduling problem for the phase- 
fading HD-MRC. 

Consider the following maximin optimization problem: 



max min {Riip)\, 
peg ie{i,2,...,K} 



(15) 



where p = (po,Pi, • ■ ■ ,Pm-i) is a point in the closed convex 
set Q in TZ^^ , and each Ri{p) is a real and continuous function 
of p. Both M and K are finite. Our aim is to find the optimal 
p* and R* given as follows: 



p* G argmax min Ri{p) 

peg ie{l,2,...,K} 



(16a) 



R* = max min Ri(p) — min Ri(p*), (16b) 
peg ie{i,2,...,K} ie{i,2,...,K} 

i.e., we want to find an optimal p that attains ( [T5] l, denoted 

by R*. 

Solving the aforementioned maximin optimization problem 
corresponds to finding the optimal schedule and the best DF 
rate considered in this paper In this application, Q becomes 
the set of all feasible schedules, p is a schedule from Q, Ri (p) 
is the reception rate for node p* is an optimal schedule, 

and R* is the corresponding (maximum) DF rate. 

For Pi,P2 G G, we denote by \p-^ ^ P2I ~ 

V (Pl,0 - P2,o)^ + [Pl.l - P2,lY H h {Pl.M-l - P2,M-lY 

the Euclidean distance between Pi and 

Definition 1: For a set S = {61, 62, ■ • • , ^s}, we define the 
set of arguments of local maxima (denoted by arglmax) as 

p' e arglniax{i?;,j (p) = iJ^^ (p) = • • • = R^^ (p)}, (17) 
peg 



iff 



R,{p')^R,{p'), ^i,jeB, (18) 



and if there exists some positive e such that for any p E Q 
where |p — p' | < e, there exists at least one index i E B such 
that 

R^ip) < R^{P')■ (19) 



p* e arglmax{i?bi (p) = Rb,{p) = ■ ■ ■ = Rb^ (p)} (20) 
p 

R* ^ R,{p*) < Rj{p*), yieB,yj(^B. (21) 



Proof of Lemma [7} By definition, R* takes the value of 
the minimum of {Ri{p*) : i e {1,2,..., K}} for some p*. 
We let the set of all indices {i} where Ri{p*) equal R* be B. 
This means we can always write R* — Rb-^{p*) — Rb^ip*) = 
■■■ = RbniP*) for some p*, and R* < Rj{p*), Vj ^ B. So, 
pT| ) must be true. Recall that Ri{p) is a continuous function 
of p. Now, if ( |20] i is false, we can always increase all {Ri{-) : 
i £ B} simultaneously by some small positive amount by 
using a new p' — p* + e, where e is some length-M vector 
with |e| being small, such that minigg Ri{p') = R' > R* and 
R' < Rj[p'), Vj i B. This means if ^ is false, R* cannot 
be the solution (contradiction). ■ 

For p* and B in Lemma [T] that attain the solution to ( [T5] l, 
we call the indices {61, 62, ■ • • , &s} the bottleneck indices of 
the optimization problem. 

Now, we present an algorithm to find the optimal p* and 
R* . This is a greedy algorithm that searches through all local 
maxima that satisfy the necessary condition in Lemma [T] 

Algorithm 1 (for the general maximin problem): 

i) Initialize B = Q. 

ii) Set B ^ B + I. For each 
subset/selection B = {61, ^2, • • ■ , ^s} ^ 
{1,2, . . . , A'}, find all p' e G such that 



p' e arglniax{ i?{,^(p) = Rb^{p 
peg 



■■■ = Rbs{p)} 
(22) 

R{B,p')^RbAp')<Rj{p'), ^3iB. (23) 



Repeat Step [ii] until \B\ ^K. 
iii) The optimal solution is given by the 
largest R{B,p') and its corres- 
ponding p' . 



B. A Simplified Algorithm for the Phase-Fading HD-MRC 

Now, we show that the algorithm proposed in the previous 
section can be simplified when used to solve the optimal 
scheduling problem for the phase-fading HD-MRC. Recall that 
the optimal scheduling for the D-node channel is to find p* {s) 
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that attains 



max min 

p(s) ie{2,3,. ..,£>} 



E 



i.t. iGC(s}u{D} 



p{s)L 



SA;e{l}u({2,...,i-l}nr(s)) ^k,iPk 



(24) 



In the previous section, we have an algorithm to solve p that 
attains 

max min Riip), (25) 

which is exactly the optimization problem in ( p4j l with the 
following substitution: 



• P = (j50,Pl ■ • • ,P20-2-l), 

where each pj is uniquely mapped from each p{s). 



• K = D-\. 



(26) 

(27) 
(28) 



all states s 
i.t. node listens in state s 



p{s)T 



E 

State I 
L) lisi 

/ ^ 

all nodes 

V 



all nodes k<.i 
s.t. z transmits in state s 



J 



(29) 



meaning that Ri{p) is the reception rate of node {i + 1), i.e., 

ri+i- 

We further define the following: 

Definition 2: For a set S = {&i, ■ • • , ^b}. we define the 
following global maximum function: 



p' e argmax{i?{„ (p) = i?^^ (p) = • • • = i?^^ (p)}, (30) 
pee 



iff 



p' e argmax i?hj(p), 
pee 



subject to the constraint that 

i?,(p') = i?,(p'), Vi,je6. 



(31) 



(32) 



Now, we will show that when applied to the phase-fading 
HD-MRC, Algorithm [T] can be simplified to the following: 

Algorithm 2 (for the phase-fading HD-MRC): 

i) Initialize B = Q . 

ii) Set B ^ B + 1. 

• If there exists some p' ^ G such 
that 

p' e argmax{i?i(p) = i?2(p) = • • • = -Rb(p)}, 
pee 

(33) 

i?i(p') <i?,(p'), Vj^S, (34) 



go to Step |iii| . 
• Else, repeat Step |ii| . 
iii) The algorithm terminates with 

an optimal schedule p* = p' and 
i?DF-i?l(p')- 



Comparing Algorithms [T] and [2] we need to prove the 
following for the latter: 



i) 



ii) 
iii) 



In Step [n] of the algorithm, instead of considering every 
possible size-B subset of {1,2,..., K}, we only need 
to consider {1, 2, . . . , B}. 

Instead of considering local maxima in ( p2) i, we only 
need to consider the global maximum in ( |33| l. 
We can stop the algorithm when a solution for ([33| and 
p4l ) is found, i.e., we do not need to proceed to a larger 
B if the terminating conditions are satisfied. 
The proof can be found in Appendix [C] This also proves 1 30 
Conjecture 1]. 

Remark 5: Comparing Step [n] in Algorithms [1] and [2] we 
need to consider (2^ — 1) combinations of B in the first 
algorithm for the general maximin optimization, but only need 
to consider at most K combinations in the second algorithm 
for the phase-fading HD-MRC, where K = D - 1. 

Remark 6: The set B that gives the solution corresponds 
to the set of nodes whose reception rates constrain the overall 
rate i?DF- We see that the bottleneck nodes for the phase-fading 
HD-MRC are always the first B relays, or all relays and the 
destination for the case of B = D — 1. This means it is always 
the "front" nodes that constrain the overall DF rates. 

In Appendix [P} we give an example showing how we can 
find an optimal schedule for the four-node phase-fading HD- 
MRC using Algorithm |2j and obtain closed-form solutions 
under certain channel topologies. 

C. A Further-Simplified Algorithm for the rSNR-Degraded 
Phase-Fading HD-MRC 

We further simpUfy Algorithm |2] for the received signal-to- 
noise ratio (rSNR) degraded phase-fading HD-MRC, defined 
as follows. 

Definition 3: A D-node MRC is said to be rSNR-degraded 
if and only if for all fee {1,2,..., D} where i < j < k, 
we have 



> 



Nk 



(35) 



In an rSNR-degraded channel, as the node index k increases, 
the SNR from each lower index node to node k becomes 
smaller 

Remark 7: The rSNR-degraded MRC is different from 
the degraded MRC defined by Xie and Kumar fD, Def- 
inition 2.3]. In the degraded MRC, {Xi, . . . , Xk-i) 
(Yk,Xk,...,XD-i) (Yfc+i,...,yD) forms a Mai'kov 
chain for each k. Applying this to the AWGN channel, 
roughly speaking, node (fc + 1) receives the signals from nodes 
{l,2,...,fc — 1} through Y^. We see that the degraded channel 
is defined with respect to the channel from all upstream nodes 
(collectively) to a node, while the rSNR-degraded channel 
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is defined for node pairs. Take D = 3 for example. An 
example of the degraded condition is Y2 — -v/Ai^e-'^'^^'^^Xi + 




yA2,3e-'''(2,3)jj^2 + Z3, where Ai 3 and 6''(1,3) are known to 
node 3. The rSNR-degraded condition simply requires that 



N2 ^ ^3 ■ 

For the rSNR-degraded phase-fading HD-MRC, the optimal 
scheduling problem can be found using the following algo- 
rithm. 

Algorithm 3 (for the rSNR-degraded phase-fading HD-MRC) 

Find p* that attains 

max{i?i(p)=i?2(p)-----i?K(p)}. (36) 

Proof for Algorithm^ Suppose that an optimal schedule 
and i?DF are given by 

• - Rb{p)} 

(37) 
(38) 



p* e max{i?i(p) = i?2(p) 
p 



RDF = RBiP*)<RjiP*),yj(^B, 

for some 6 = {1, 2, . . . , 5} C {1, 2, . . . , K}, i.e., B < K. 

From the proof for Algorithm [2j we have established that 
node (B + 1) must always listen in the optimal schedule. So, 
the reception rate of node (B + 1) is 

rs+i = Rb{p*) 



E 

all states s 



pis)T 



all nodes 
s.t. i transmits in state s 



B+l 



V 



(39) 



Now, the reception rate of node (B + 2) is 

rB+2 = (40a) 
/ E A,,B+2PA- 

all nodes i<B 1 
/ \Tn S.t. i transmits in state s 



all states s 



B+2 



V 



< Rb{p*)- 



J 

(40b) 
(40c) 



Eqn. ( |40b| l is derived because node (B + 1) always listens; 
so nodes (B + 2) and {B + 1) both decode from the same 
set of nodes in each state. An equality is achieved when node 
(B + 2) also always listens in all states. Eqn. ( |40c| l follows 
from the definition of rSNR-degraded MRC. 

So, condition ( (38] l cannot be satisfied for any B C 
{1,2,...,K}, and therefore B < K will not give an op- 
timal schedule. This means the only possible case is S = 
{l,2,...,if}. ■ 

Remark 8: Compared to the original maximin optimization 
in ( [T4] l, we only need to solve the max optimization in ( |36l ) 
to obtain an optimal schedule for the rSNR-degraded phase- 
fading HD-MRC. 

Remark 9: For the rSNR-degraded phase-fading HD-MRC, 
the bottleneck nodes are always all the relays and the des- 
tination, regardless of the network topology. This means in 
an optimal schedule, the relays coordinate their transmit/listen 
states so that no single relay or a subset of relays will constrain 
the overall DF rate. 

A note on complexity: In Algorithm [3] we only need to 
consider one case for B, compared to (2^~^ — 1) cases in 
Algorithm [1] However, equating the equalities in ( [36] l has a 
time complexity of 0(2^~^), where 2^~^ is the length of the 



vector p. A heuristic algorithm 1 3 1 1 can be used to reduced the 
dimension of p, but doing that does not guarantee optimality. 

VI. Numerical Computations 
A. The Four-Node Phase-Fading HD-MRC 

In this section, we demonstrate that Algorithm [2] provides 
insights into how one would operate the network. We consider 
the four-node phase-fading HD-MRC where the nodes are 
placed on a straight line. The node coordinates are as follows: 
node 1 (0,0), node 2 (0,?;2), node 3 (0,2/3), and node 4 
(0, 100). We set the channel parameters to be k = 1, 77 = 2, 
and fix the per-symbol power constraints to be Pi — 10 for 
all i E {1,2,3}, and the noise variance at the receivers to 
be Nj = 0.01 for all j € {2,3,4}. Define po = p{L,L), 
Pi = p{L, T), p2 = p{T, L), and p^ = p{T, T). 

Fig. [2a] shows DF rates for the four-node phase-fading HD- 
MRC with varying relay positions using Algorithm |2] and 
Fig. 2b shows the capacity upper bound. Note that at any 2/2 
and 2/3, the DF rate is strictly below the capacity upper bound. 

Fig. 3a shows which case in Algorithm |2] — whether \B\ is 
1, 2, or 3 — corresponds to the optimal schedule, for the node 
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MRC: (0,0), (0,)/2), (0,?y3), (0,100) 
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Fig. 3. Optimal schedules for the four-node phase-fading HD-MRC: Graph (a) shows |B| (the number of bottleneck nodes) for which an optimal schedule 
can be found. Graph (b) shows optimal schedules for j/2 = 66 and varying node 3's position: for each j/3, the optimal schedule consists of probabilities of 
the four states, where the probabilities of the states take the height of their respective regions at yi [the probability of the state (T, T) is always zero]. 



positions yi = 0, ^4 = 100, and < 2/2,2/3 < 100. From ( |79| ) 
in the derivation of the optimal schedule in Appendix [D] we 
know that |S| = 1 has an optimal schedule only when node 2 
is further from node 1 than node 3 is from node 1, or 2/2 > J/3- 
From Fig. |3aj we see that \B\ =2 has no optimal schedule 
for these network topologies. In addition, if the nodes in the 
network are arranged such < 2/2 < J/3 < 100, then we only 
need to consider \B\ =3. 

Fig. |3b] shows optimal schedules for varying node 3's 
position (0,2/3), while fixing the rest of the nodes' positions 
at 2/1 = 0, 2/2 = 66, and 2/4 = 100. From Fig. 3a we 
see that for 2/2 = 66 and 20 < 2/3 < 53, p* is found in 
the case \B\ = 1, meaning that the optimal schedules are 
(pg, 1 — p'q,0,0). We can also see this from Fig. 3b that we 



only need to operate the network in two states, (L, T) and 
(L,L), for 20 < 2/3 < 53. This means node 2 does not 
need to transmit, and only node 3 needs to toggle between the 
transmitting and the listening modes. Meanwhile, for 2/2 — 66, 
< 2/3 < 20 or 53 < 2/3 < 100' three states are used in the 
optimal schedule. 

B. rSNR-degraded FD-MRCs and HD-MRCs 

Next, we compare DF rates for the Z3-node phase-fading 
FD-MRCs and HD-MRCs. We consider the line topology with 
two settings: 

i) the coordinate of node i is (i, 0); 

ii) the coordinate of node i is (^jztijO)- 

Note that the distance between the source and the destination 
increases in the first case, and remains fixed in the second 
case. 

We fix the per-symbol power constraints Pi = 10 for all i 
and the noise variance at the receivers Ni = 1 for all i. As 
these MRCs are rSNR-degraded, we can calculate DF rates 
for the half-duplex channels using Algorithm [3] 

Fig. [4] shows DF rates for these two networks as the 
number of relays increases. As expected, DF rates for the half- 
duplex network are upper-bounded by those for the full-duplex 
network \2j. The DF rates are significantly reduced in the half- 
duplex networks compared to the full-duplex counterparts as 
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3 
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1.5 
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Full Duplex, Vi = 
Half Duplex, = 
Full Duplex, V, =2{)i/(D-l 
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10 12 14 
Number of nodes, D 

Fig. 4. _Rdf vs D for phase-fading MRCs, where the coordinate of node i 

is {vi,0) 



the number of relays increases. In the first setting where all 
adjacent nodes are separated a unit distance apart, the full- 
duplex DF rate remains constant regardless of the network 
size (or equivalently, the source-destination distance) as the 
link from node 1 to node 2 is always the bottleneck. This is 
because any added node sees a higher SNR compared to its 
upstream nodes, and hence the reception rate of the added node 
is always higher This is not true when the nodes cannot listen 
and transmit simultaneously. As the relays do transmit all the 
time (otherwise their reception rate will be zero), the added 
node may not have a higher reception rate compared to its 
upstream nodes. We illustrate this by comparing two networks 
of D — 2 and D ~ 3. For D = 2, the source (node 1) always 
transmits and the destination (node 2) always listen. So, the DF 
rates for the HD-MRC, denoted by R^p[]j^2]' equals that for 
the FD-MRC, denoted by R[,p[jj^2]- Next, consider the FD- 
MRC with D = 3. The reception rate at node 2 (which can 
listen all the time) remains the same at R[)^u^2] - nodes 1 
and 2 always transmit, node 3 sees a strictly higher SNR, and 
has a strictly higher reception rate. So, the DF rate for D — 3, 
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which is constrained by the reception rates of nodes 2 and 3, 
is the same as that for D — 2, i.e., R^jp^^^^] — Rl)f:[D=2]- 
Now consider the HD-MRC with D = 3. The relay (node 
2) allocates its time to transmit and to listen, and hence its 
reception rate is lower than R^p[D=2]- ^ addition, node 3 
receives transmission from node 1 (which is further away) 
all the time, but from node 2 only a fraction of the time. 
From Algorithm [3] we know that node 2 picks the fraction 
of transmission time such that its reception rate equals that 
at node 3. Hence the DF rate for the HD-MRC is lower than 
that of the FD-MRC for D = 3, i.e., R^p^^^a] < ^df[d=3] 
This example illustrates a fundamental difference between the 
full-duplex and the half-duplex networks. 

VII. Reflections 

We have investigated achievable rates and optimal schedules 
for the decode-forward (DF) coding strategy for the half- 
duplex multiple-relay channel (HD-MRC). The code construc- 
tion in this paper differs from that of the traditional half-duplex 
network. Traditionally, a node transmits an entire codeword in 
one state duration. The transmission time for a codeword is 
shorter than the duration that the nodes stay in a particular 
state. In any state, data can only be sent to and from fixed 
sets of nodes. Thus, the order of the states becomes important 
as data need to be routed from the source to the destination 
through the relays. Hence, the order of the states and the time 
fraction of the states are parameters of the optimization. 

Inspired by the coding scheme for the half-duplex single- 
relay channel, where the codewords of the source and of the 
relay are split into two parts (one part when the relay listen, 
and another part when the relay transmits), we approached the 
scheduling problem in this perspective. Each codeword cycles 
through all states, meaning that the states change during one 
codeword duration and they are independent of how the data 
flow. The advantage of this approach is that the order of the 
states is not important as far as the rate is concerned and we 
have shown that only the time fraction (or the probability) of 
different states needs to be optimized. 

We have proposed an algorithm to solve a general class of 
maximin optimization problems. When applied to the phase- 
fading HD-MRC, the algorithm can be simplified. Further 
simplification to max optimizations has been shown to be 
possible for the rSNR-degraded phase-fading HD-MRC. The 
simplified algorithms have allowed us to obtain closed-form 
solutions for optimal schedules for certain network topologies. 

From the algorithms, we realized that the bottleneck nodes 
of the phase-fading HD-MRC are always the first B relays 
for some 1 < B < D — 2, or all the relays plus the 
destination. Furthermore, the bottleneck nodes for the rSNR- 
degraded phase-fading HD-MRC are always all the relays plus 
the destination, independent of the channel topology. This 
shows that in the half-duplex network, the transmit/listen state 
attempts to balance the transmission bottleneck of the half- 
duplex network. 

We have also discovered that, unlike the full-duplex case, 
where DF achieves the cut-set upper bound and hence the 
capacity when the relays are close to the source |2|, DF in the 
half duplex case does not achieve the cut-set bound. 



Appendix A 
Proof of Theorem[T] 

Consider the capacity upper bound of the half-duplex multi- 
terminal network |4, Corollary 2] with the following additional 
constraints due to the HD-MRC model considered in the paper: 

• We only consider the rate from node 1 to node D, which 
is R. 

• There are at most 2^^^ states, and the time fraction of 
the network in state s is p{s). 

We have the following upper bound on the maximum achiev- 
able rate R (i.e., the capacity C): 

C<supmin Pis)IiX,,XQ;YQ.,YD\XQ.,S), 

P(^) se{L,T}o-2 

(41) 

for some p{xi, X2, ■ ■ ■ , where the supremum is taken 

over all possible schedule p{s), and the minimization is taken 
over all possible Q C 72,. Here, Q'^ — TZ\Q, and TZ is the set 
of all relays. 

In state s, we have the following: 

• For Xq: ^Qnr(s) ^re transmitted, and ^gn£(s) = 
^Qn£(s) ^6 known to all nodes, 

• For Yqc: YQcn£(s) are received, and yg<:nT(s) = 
j/Q=nr(s) are known to all nodes. 

• For Xqc-. ^Qcn7-(s) are transmitted, and -'^gcn£(s) — 
^C'=n£(s) ^6 known to all nodes. 

Note that for four random variables A, B, C, and D, if B 
takes on a fixed value e, we have 

I{A, B = e; C\D) = I{B = e; C\D) + I{A; C\D, B ^ e) 
= I{A;C\D,B = e) (42) 

I{A; B = e, C\D) = I{A; B = e\D) + I{A; C\D, B ^ e) 
= I{A;C\D,B = e). (43) 

So the mutual information term on the right-hand 
side of (|4T]i becomes /(^i, Xgnr(s); >Q'^n£(s), 
^Q-'nTis), XQc(^jr(^s^ = a;g<:n£(s)> -'^Qn£(s) = S:Q(^jr{s), 
YQcnT(s) = j/Q-nr(s))- Rearranging the variables, we get 

^Q-nr(s) = yg-nrCs))- 

Under the half-duplex constraints, we also have 

p{xi,X2, ■ ■ . ,XD-l|s) 

= W 5{xk ^ Xk)p{xi,xr(s)\xc{s),s). (44) 
kec{s) 

Since we can choose any schedule p{s) and any input 
distribution X2, . . . , xd-i\s) of the form in ( |44] i, we get 
Theorem [TJ ■ 

Appendix B 
Proof of Theorem[3] 

As mentioned, the random coding proof for the FD-MRC |[T] 
Section IV.2] is adapted to our context of the HD-MRC. In ad- 
dition, we use the idea of splitting the codewords into different 
parts, one for each state |5|. We consider n transmissions or 
channel uses as a block and send (M — D + 2) messages over 
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M blocks. For any ri, we can choose a sufficiently large M 
such that the effective rate ^^(.^'^^^+^) ^^in be made as close 
to R as desired. As there are {D — 2) relays, the total number 
of states s is 2^^^. Let the states be Sq, Si, . .., S2d-2_i, 
and the probabilities of the states be po, pi, p2D-2_i 
respectively. First, we generate codebooks for the HD-MRC. 
Codebook generation: 

i) Fix, p{s), i.e., po, pi, p2D-2_i. We assume that 
rii = npi are integers. 

ii) Fix the states for all channel uses in each block as 
follows: for the first no — npo channel uses, the state 
is fixed at Sq, for the next ni — npi channel uses, the 
state is Si, and so on. 

iii) Fix the probability mass functions 

p{xi,X2, . . . , XD^l\s) = p{xd-i\s)p{xd-2\xd^1, S) 
■ ■ ■p{xi\x2,X3, . . . ,XD^1,S), (45) 

one for each state s. 

iv) For the p{xd-i\s) chosen in ( |45] l, generate 2"^ inde- 
pendently and identically distributed (i.i.d.) ?i-sequences 

G '^D-i according to 



(46) 



where n_i = 0, and Xi[t\ denotes the t-\h input 
from node i into the channel. Index the codewords as 

XD-i{wD-i), WD-i e {1,2,...,2"«}. 
v) Repeat the following steps for k = D — 2, D — 3, . . . ,1: 

a) For the p{xk\xk+i, Xk+2, ■ • • , xd-i,s) chosen in 
(|45|, and for each 

Xk+l:D-l{Wk+l,Wk+2, • ■ • , WD-l) 
^ (xk + l{wk+l\Wk+2, ■ ■ ■ ,WD-l), 
Xk+2iXk+2\wk+3, ■ ■ ■,WD^l), 
■ • ■ ,XD-liwD-l) 

generate 2"^ conditionally independent 
sequences Xk ^ XJ} according to 

P{Xk\Xk+l:D-l{Wk+l,Wk+2, ■ ■ -iWD^l)) 

= n n 

p(^Xk[t]^Xk+l:D-l{wk+l,Wk+2, ■ ■ . , 

where 

Xk+l-.D^liWk+l, Wk+2, ■ ■ ■ ,WD~l)[t] 
= {xk+l{Wk+l\Wk+2, ■ ■ ■ ^Wd^iM, 
Xk+2ixk+2\wk+3, ■ ■ . , WD-i)W, 
■ • ■ ,XD-l{wD-l)[t] 



Index the codewords Xk{'Wk\'Wk+i,Wk+2, 
...,WD-i), Wk e {1,2,...,2"«}. 



b) Repeat Stepjvja) with k ^ {k - 1). 
Encoding and Decoding: 

Let the message sequence be w(2), . . . , w{M — D + 

2). These messages will be sent in M blocks of transmissions. 
Let node fc's estimate of message w{m) be Wkim). 

i) In block m e [1,M], node k G [!,£>- 1] sends 
Xk{wk{Tn — k + l)\wk{m — k), . . . , Wk{'m— D + 2)) . We 
set wi{l) ~ w{l) for all I, and Wk{m) = 1 for to < 
and ni > M — D + 3. The transmission from node k in 
block TO depends on the messages that it has decoded 
in the past {D — k) blocks. 

ii) In block to £ [1,M], node k e {2,...,D} receives 

iii) At the end of block to, having already decoded w/c(l), 
Wk{2), . . . , Wk{m — k + l) in the previous blocks, node 
k declares Wk{m — k + 2) = w, if there exists a unique w 
such that the following is true for all e {0, 1, . . . , fc — 
2}. 



{ 



Xk-i-jiwlwkim - k + 1),. . . ,Wk{m 
Xk-jiwkim - fc + l)\wk{m - fc)), . . . 
3 



-j-D + 2)), 



Wk{m-j-D + 2)), 
■ ■ ■ ,XD-iiwk{m - j - D + 2),y^.{m - j), s| 
e . . . , X£)-i,Yk, S), 



(47) 



where s = (s[l], s[2], . . . , s[n]) is the vector 
of states for the block of n channel uses, and 
Ai^\Zi, Z2, ■ ■ ■ , Zm) is the set of e-typical n- 
sequences (2:1, 2:2, . . . ,zm) defined as follows. 

Definition 4: An e-typical n-sequences A^iP'\Zi,Z2^ 
.,Zm) with respect to a distribution p{zi, Z2, ■ ■ ■ , zi\i) 



on Zi X Z2 X 



-■M 



is the set of sequences 



(^1,^2 
1 



. . . , zm) e 2" X Z2 X • • • X such that 
\ogp{u) - H{U) 



< e, 



yu c{Zi,Z2,...,Zm}, 

(48) 



where p{u) = nr=i?'("W)- 

Probability of Error Analysis: We define the following: 



A. 



(to) = the event that no decoding error occurs at any 
node in the first to blocks, (49) 
= {all we {1,2,..., 2"^} that satisfy ^} , 



k-2 



Wfe(TO)^ fl Wfe,j(TO). 



(50) 
(51) 



Wkijn) is the set containing node fc's estimate(s) for the 
message w{m — fc + 2). This decoding step is done in block 

TO. 



Using a property of e-typical sequences |28 Theorem 
15.2.1], with sufficiently large n, we can bound the probability 
that the correct message ^(to— fc+2) is not in the set Wk,j{'ra) 
to be arbitrarily small, i.e., 

Pr{«;(TO - fc + 2) ^ Wfc,j (m)|Aeon-ect(m - 1)} < e. (52) 
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I{Xi,X2, ■ ■ ■ , Xk-i;Yk\Xk, Xk+i . . . , Xd-1, S) 

J2 pis)I{Xi,X2,...,Xk-i;Yk\Xk,Xk+i...,XD-i,S = s) (57a) 

se{so,si,...,s^o-i _i} 

pis)I{Xi,X2,...,Xk^i:Yk\Xk,Xk+u...,XD-uS = s) (57b) 

s6{so,Si,...,S2_D_l _ 

s.t. keC{s)u{D} 

= Y p{s)I{Xi,X[2,...,k-i}nTis);Yk\X[k,...,D-i}nTis)AXj ^ S:j : j £ C{s)},S = s). (57c) 

se{sa,si,...,S20-i _i} 
s.t. keC{s)u{D} 



Following p8] Theorem 15.2.3] with the substitutions 

5*1 = Xk-i-j{w'\ • • • ), 5*2 = Yk, S3 = {Xk-j, • • ■ , Xd~i), 

(53) 

for some w' w{m — fc + 2), and with a sufficiently large n, 
we can again bound the probability that the wrong message 
w' is in the set Wfe j(to) to be arbitrarily small, i.e., 

Pr{u/ e Wfc,j(TO)|Aco,Tect(TO - 1)} 

for all je {0,1,..., A: -2}. 

From ( |52l ), we can bound the probability that the correct 
message w{m — fc + 2) is not in node fc's set of message 
estimates to be arbitrarily small, i.e., 

Y>x{w{m - fc + 2) ^ Wfc(m)|AorTect(m - 1)} 

fc-2 

< Pr{w(m - fc + 2) ^ Wfej(m)|Aorrect("i - 1)} 

< (fc - l)e (55a) 
<{D- l)e. (55b) 



Eqn. ( |55a| i is true for any e > if a sufficiently large n is 
chosen. 

From ( |54] l, we can also bound the probability that any wrong 
message w' is in node fc's set of message estimates to be 
arbitrarily small, i.e., 

Pr |Any wrong message w' 7^ w{m — fc + 2) e Wkim] 

^correct ~ "'^'^l 

< ^ Pr \w' e Wk{m) AorTect(m - 1)| 



iii'G{l,2,...,2"'"} 
s.t. w ^w{ra~k+2) 



E 



Pr 



uj'e{l,2,...,2"'^} 
s.t. w' ^w{'rn — k+2) 



Wfcj(m),Vie [0,fc-2] 



^correct 

fc-2 



(56a) 



(56b) 



o'g{1,2,...,2"'^} J=0 
s.t. w' ^w{m— k+2) 



A,, 



(m — 1 



)} 

(56c) 



^ (^2"-R - ]^-)2-"5Zj;o(-f(^fc-i-j;'^fc|^fc~j,...,XD-i,S)-6e) 

(56d) 

^ ^2"-'^ _ l'^2^ri{I{Xi,...,Xk_i;Yk\Xk.Xk + i....,XD-i.S)~6ik-l)e) 

(56e) 

_^ 2-"[-f(^i''^2,...,Xi-_i;Ffc|Xfc,Xfc+i...,XD_i,S)-6(fc-l)e-i?] 

(56f) 

< ei, (56g) 



where ( |56d| i follows from ( |54] l. 

Eqn. ( 56g I is true for any ei > if a sufficiently large n is 
chosen and if the following condition is satisfied: 



R < I{Xi,X2, . . . , Xk-i;Yk\Xk, Xk+i ■ ■ 
-6(fc- l)e. 



■ , S) 



(58) 



where 6(fc — l)e — > as n increases. 

So, if n is sufficiently large and if ( |58] l is satisfied, com- 
bining ( |55b| i and ( |56g| l, the probability of node fc making a 
decoding error in block m, conditioned on the event that there 
was no decoding error in the previous (m — 1) blocks, can be 
made as small as desired, i.e., 

Pr |Node fc wrongly decodes w(m — fc + 2) in block m| 

Acorrect(m-l)} < p-l)e + ei. (59) 

Now, consider all the nodes and all the blocks. Using ( |59l ) 
and the same arguments as in the probability of any node 
making a decoding error in any block can be made as small 
as desired, for any chosen M and D, if n is sufficiently large 
and if ( [58] l is satisfied. 

The mutual information term in ( |58| l can be written as ( |57a[ ) 
shown at the top of the page. For nodes that transmits, i.e., j ^ 
C{s) U {£)}, we have Yj = yj and I{Xi,X2, . . . , Xj^i;Yj = 
yj\Xj,Xj-^-l,. . . ,Xd~i,S = s) = 0. This gives ( |57b| l. For 



nodes that listen, i.e., j e C{s), we have Xj 
(|57c 



This gives 



Since the result holds for any state and input distributions 
in the form p(a::i,X2, . . . ,a;_D_i,s) ^ p{s)llkec{s) ^^^i' ^ 
Xk)p{xi,xr(s)\xc{s), s), we get Theorem [3] ■ 

Appendix C 
Proof for Algorithm |2] 

In this section, we will prove that when applied to the 
phase-fading HD-MRC, Algorithm [T] can be simplified to 
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Algorithm |2] The proof consists of three parts, which are 
the differences between the two algorithms. Without loss of 
generality we assume that foi < 62 < • ■ ■ < bs- 

A. To prove that we only need to consider one set, i.e., 
{1,2,..., B}, for each B 

First, we consider Step [n] in Algorithm [T| for \B\ = 1. All 
possible cases for B are {1}, {2}, {K}. Suppose that 
an optimal solution exists for B — {j} where j 7^ 1 with an 
optimal p* where 

p* e arglmax i?j (p) , (60) 
p 

R,{p*)<Ru{p*), yk^j. (61) 

Recall that Rj{-) is the reception rate of node (j + 1). The 
optimal p* must have zero probability for all the states in 
which node {j + 1) transmits. Otherwise, we can decrease 
the probability of a state s' in which node (j + 1) transmits 
by some small amount e and increase the probability of 
the state s" by e (where s' and s" only differ in node 
(j + l)'s transmit/listen mode). Let the new schedule be p". 
The new schedule will have a higher reception rate for node 
(j + 1), Rj{-), by some small amount while maintaining 
Rj{p") < Rk{p"), for all k ^ j, since all {R^{p)'\ are 
continuous functions of p. So, in the optimal schedule, node 
(j + 1) always listens. 
In addition, suppose that p* has a non-zero probability, 
= Pa > 0, for state s' = (L, . . . ,L,. . .) in which 
both nodes 2 and (j + 1) listen, and define — pi,, 

where s" equals s' except for the first position, i.e., in state 
s" — (T, . . . , L, . . .), node 2 transmits and node (j+1) listens. 
We now show that p* cannot be an optimal schedule. Let p" be 
the same schedule as p* except that it has p{s') = Pa — ^ and 
p{s") = Pb+e, for some small positive e. Since j > 1, we have 
Rj{p") > Rj{p*). Also, since e is a small positive value, the 
inequality in ( |6T] ) is still maintained, i.e., Rj{p") < Rk{p"), 
for all k j. This means p* cannot be an optimal schedule. 
So, the optimal schedule p* for \B\ — 1 where B 7^ {1} must 
have zero probability for all states in which node 2 listens, 
i.e., node 2 always transmits in p*. This implies that if j ^ 1, 
i?i(p*) = 0, and condition ^6T\ cannot be satisfied for k = 1 
(contradiction). So, for \B\ = 1, we only need to check for 
B^{1}. 

Next, consider 2 < B < K — 1. If an optimal schedule 
exists, it must satisfies the following conditions: 

p* e arglmax{i?f,i (p) = Rb^ip) = ■ ■ ■ = Rb^ {p)}, (62) 
p 

RbAP*) < Rk{p*),yk B. (63) 

First, we show that we need to consider only schedules 
in which node (63 + 1) always listens. Otherwise, we can 
increase Rbsi') by reducing the probability of a state s', in 
which node (65 + 1) transmits, by a small amount and increase 
the probability of the state s" (which only differs from s' 
in which node (6^ + 1) listens) by the same amount. Doing 
this will not affect the reception rate of all upstream nodes, 
i.e., Rb^{-), Rb2{-), ■ ■ ■ , RbB-ii') (since we assume that bi < 



b2 < ■ ■ ■ < bs)- The reception rates of other nodes will only 
change by a small amount while maintaining ( |63| l. So, the new 
schedule will also achieve the same overall rate but with \B\ — 
B — 1, and we would have found this optimal schedule earlier 
in Step [n] in Algorithm [T] So, we can ignore the schedule in 
which node (6^ + 1) transmits with non-zero probability. 

Having established that node (6^ + 1) always listens, we 
now show that we only need to consider the case where B = 
{1,2,..., B}. Suppose that B ^ {1,2,..., B}. This means 
there exists some index i < B which is not in the set B and 
the last index in the set B must be strictly greater than B, 
i.e., bs > B. Since i ^ B, from ( |63] l, the reception rate of 
node {i + 1) must be strictly greater than R*, and this also 
means node (i + 1) listens with non-zero probability. Using the 
same argument as before, we can decrease Ri{-) by a small 
amount by decreasing the probability of the state s' in which 
node (i + 1) listens by a small amount e and increasing the 
probability of state s" (which only differs from s' in which 
node (j + 1) transmits) by e. Doing this will not affect the 
reception rate of all upstream nodes, i.e., {Rj{-) : 1 < i < 
j — 1}. However, the reception rates of all downstream nodes 
can either increase (if they Usten in s") or stay the same (if 
they transmit in s"). In particular, Rbgi-) will increase as node 
(6b + 1) always listens. This means using the new schedule, 
we can attain the optimal DF rate with fewer bottleneck nodes. 
We argued that we would have found this solution earlier in 
Step [n] in Algorithm [T] So, we can ignore this case. Up to this 
point, we have proven that for \B\ = B in Algorithm [T] we 
only need to consider B = {1, 2, . . . , B}. 



B. To prove that we only need to consider the global maximum 
in (|22]) 

Note that in the scheduling problem for the phase-fading 
HD-MRC, Ri (p) is a linear function of p (see ^^). So, the 
intersection of linear functions 

R{p) = Rb, ip) ^Rb,{p)^---^ Rb, [p) (64) 

is also a linear in p. Furthermore, the set of {p} that satisfied 
the aforementioned equation is a convex set. This means we 
can replace the local maximum function in (|22]) by the global 
maximum function in ( (33] l. 



C. R{B,p') in \23) , if exists, will not increase with a larger 
\B\ 

Next we will show that once a solution is found for ( |33| ) 
and ( (34] l in Algorithm [2j we can terminate the algorithm. 

We will prove this by contradiction. Suppose that there ex- 
ists a solution for ( [33] l and ( [34] i for some B = B', and suppose 
that we can get a better solution for some B = B" > B' . This 
is equivalent to saying that there exist some p',p" G G and 
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B' <B" <K such that 

p' £ argmax{i?i(p), i?2(p), • • • , Rb'{p)}, (65) 
p 

R' ^Ri{p')<R,{p'), yj>B\ (66) 

p" G argmax{i?i(p),i?2(p),...,i?i3'(p),...,i?i3"(p)}, 
p 

(67) 

R" = Ri{p") < R,ip"), yj>B", (68) 
where 

i?' < R". (69) 

Now, since 

i?" = i?i (p") = i?2 (p") = --- = Rb' ip") (70a) 
> i?' = = i?2(p') = • • ■ = Rb'{p'), (70b) 

eqn. ( |65| ) cannot be true as p' cannot be a global maximum 
for the set {1,2, . . . , B'} (contradiction). 

This completes the proof for Algorithm |2] ■ 

Appendix D 
The four-node Phase-Fading HD-MRC 

We let 

po ^ p{L, L),pi ^ p{L,T),p2 ^ p{T, L),p3 ^ p{T,T). 

(71) 

From Theorem [4j the optimal scheduling problem can be 
written as 

max min{r2 (p) , rg (p) , r4 (p) } , (72) 
peg 

where p = (po,Pi,P2,P3) is a schedule, = 
{{po,PuP2,P3) e 7^4 : ^^p, < l,p, > 0,Vi = 0,1,2,3} 
is the set of all feasible schedules, and 

7-2 = P0r(Ai,2Pl/iV2) + Pir(Ai,2Pl/iV2) (73) 
rg = Por(Ai,3-Pl/iV3) + P2r((Ai,3P3 + A2,3^'2)/VV3) (74) 
r4 = Por(Ai,4Pl/iV4) + pir((Ai,4Pl + A3,4P3)M) 
+ P2r((Ai,4Pl +A2,4P2)/A^4) 

+ P3r((Ai,4Pi + X2AP2 + X3aP3)/N4). (75) 

Using Algorithm [2] we start with B = {!}. We need to find 
some p* such that 



p* G maxr2(p), 
peQ 

r2{p*) < rsip*), 
r-2(p*) < r-4(p*). 



(76) 

(77) 
(78) 



The first condition means that p'^+pl = 1, and p'2 ^ P3 ~ 0. 
The second and the third conditions simplify to 



<pS<i, 



(79) 



r(Ai,2Pi/iV2) 
r(Ai,3Pi/iV3) 
n< . ^ r((Ai^4Pi + A3.4P3)M) - r(Ai.2Pi/Af2) 

- ^° ^ r((Ai^4Pl + A3,4P3)/iV4) - r(Ai.4Pl/iV4) ' ^ ' 

respectively. If we can find some p'q G [0, 1] satisfying ( |79l ) 
and (|80|, then the optimal schedule is given by p* — {pq, 1 — 

p5,o,o). 



If conditions ( |79| ) and ( [80| ) cannot be satisfied, we proceed 
to S = {1,2}. We need to find some p* such that 

p* e max{r2(p) = r3(p)}, (81) 

'^2(P*)='^3(P*)<'^4(P*). (82) 

Maximizing r2 (p) and (p) gives + P2 = 1 -Pi — 
pI = 0. Furthermore, r2(p*) = r3(p*) gives p*r j 



A1.3P1 
7V3 



Al,3P3+A2,3P2 



r 

.3-^3"!" '^2,3-^2 



Af3 

3P3+A2,3P2 



So, we have 



W3 



7V3 



Al,2-Pl 
Af2 



(^) 



P2 = 



(^)-r(^) 



-^1,3^3 + ^^2,3^2 



+ r(^)-r(^) 



(83) 



(84) 



The second condition (|82]l is thus equivalent to 



r 

< r 



Al,3P3 + A2,3P2 
Al,4P4 + A2,4P2 



Al,2Pl 

N2 

Al,2Pl 

No 



-r 



Al,4Pl 

N4 

Al,3Pl 
A^3 



(85) 



If condition ( (85| cannot be satisfied, we proceed to B = 
{1,2,3}. We need to find some p* such that 

p* G max{r2(p) = r3(p) = r4(p)}, (86) 
pes 

We have the following linearly independent equations: 

r-2(p*) =r3(p*) (87) 

r2(p*) -r4(p*) (88) 

Po+P*i+P*2+pI^^- (89) 

Solving the above, we can express pi, P2, P3 in terms of only 
Pq. Hence, the optimization becomes 



maxr2(po), 



(90) 



subject to < Pi < 1, for i = 0, 1, 2, 3. 

Summarizing all the above, we have the following for the 
four-node phase-fading HD-MRC: 
• If 

r(Ai,2Pl/iV2) r((Ai,4Pl + A3.4P3)/iV4) - r(Ai,2Pi/iV2) 
r(Ai,3Pl/iV3) r((Ai,4Pl + A3,4P3)/iV4) - r(Ai,4Pl/iV4) ' 

r((Ai,4Pi + A3^4P3)M) - r(Ai^2Pl/A^2) 



< 



r((Ai,4Pi + A3^4P3)M) - r(Ai,4PlM) ' 



and 



r(Ai,2Pi/iV2) 



< 1, 



r(Ai,3Pi/A^3) 

then an optimal schedule is given by some pg satisfying (|79 
and ([80|, pi ^ 1 - pg, p^ = 0, and pg = 0. 



• Else, if 



< 



r 



Al.3-P3+A2,3J^2 
N3 



Al.3P3+A2,3P2 
N3 



A1.2P1 
N2 



3 Pi 



< 1 



N3 
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and ([85| are both satisfied, an optimal schedule is given by 

pI = 0, and pi = 0. 
• Else, we solve for ( |90l ). Note that this is simpler than the 
original optimization problem ( |72j i. 
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